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Abstract 

The problem of simulating sparse Hamiltonians on quantum computers is well studied. The 
evolution of a sparse NxN Hamiltonian H for time t can be simulated using 0(||fft|| poly (log TV)) 
operations, which is essentially optimal due to a no-fast-forwarding theorem. Here, we consider 
non-sparse Hamiltonians and show significant limitations on their simulation. We generalize the 
no-fast-forwarding theorem to dense Hamiltonians, ruling out generic simulations taking time 
o(||iJi|j), even though is not a unique measure of the size of a dense Hamiltonian H. We 
also present a stronger limitation ruling out the possibility of generic simulations taking time 
poly(||-ffi||,logiV), showing that known simulations based on discrete-time quantum walk can- 
not be dramatically improved in general. On the positive side, we show that some non-sparse 
Hamiltonians can be simulated efficiently, such as those with graphs of small arboricity. 

1 Introduction 

One of the primary applications of quantum computers is the simulation of quantum systems. 
Indeed, it was the apparent exponential time complexity of simulating quantum systems on a 
classical computer that led Feynman to propose the idea of quantum computation [14]. 

In addition to predicting the behavior of physical systems, Hamiltonian simulation has algorith- 
mic applications. For example, the implementation of a continuous-time quantum walk algorithm 
is a Hamiltonian simulation problem. Examples of algorithms that can be implemented using 
Hamiltonian simulation methods include unstructured search [13], adiabatic optimization [12], a 
quantum walk with exponential speedup over classical computation [8] , and the recent NAND tree 
evaluation algorithm [10]. 

In the Hamiltonian simulation problem, our goal is to implement the unitary operator e~^^* for 
some given Hamiltonian H and time t. We say that a Hamiltonian H acting on an AT-dimensional 
quantum system can be simulated efficiently if there is a quantum circuit using poly(log A^, t, 1/e) 
one- and two-qubit gates that approximates (with error at most e) the evolution according to H 
for time t. (Of course, we can rescale t by rescaling H, so the complexity of simulating H for time 
t must also depend on some measure of the size of H, as discussed in more detail below.) 

Efficient simulations are known for various classes of Hamiltonians. For example, a Hamiltonian 
for a system of qubits can be simulated efficiently whenever it is local, meaning that it is a sum of 
terms, each of which acts on a constant number of qubits [18]. More generally, a Hamiltonian H 
can be simulated efficiently if it is sparse (i.e., has only poly (log A) nonzero entries per row) and 
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efficiently row- computable (i.e., there is an efficient means of computing the indices and the matrix 
elements of the nonzero entries in any given row) [1]. 

These conditions lead to a convenient black-box formulation of the problem, in which a black 
box can be queried with a row index j and an index i to obtain the i^^ nonzero entry in the j^^ 
row. This black box can be implemented efficiently provided that H is efficiently row-computable. 
A series of results has decreased the number of black-box queries, in terms of A^, from the original 
0(log^ N) [1], to 0(log^ N) [6], to 0(log* A^) [4]. In particular, Berry, Ahokas, Cleve, and Sanders [4] 
present an almost linear-time algorithm for simulating sparse Hamiltonians with query complexity 



where d is the maximum number of nonzero entries in any row and e is the maximum error permitted 
in the final state (quantified in terms of trace distance). 

The dependence of (1) on the simulation time is nearly optimal, since it is not possible to 
simulate a general sparse Hamiltonian for time t using o(t) queries. Intuitively, there is no generic 
way to fast-forward through the time evolution of quantum systems. More formally. 

Theorem 1 (No-fast-forwarding theorem [4, Theorem 3]). For any positive integer N there exists 
a row- computable sparse Hamiltonian H with \\H\\ = 1 such that simulating the evolution of H for 
time t = 'kN/2 within precision 1/4 requires at least N/A queries to H . 

More recently, methods have been presented for simulating a Hamiltonian H that is not nec- 
essarily sparse. Of course, we do not expect to efficiently simulate a general Hamiltonian, simply 
because there are too many Hamiltonians to consider (just as we cannot hope to efficiently imple- 
ment a general unitary operation [17]). However, we can conceivably efficiently simulate non-sparse 
Hamiltonians with a suitable concise description. In particular, by applying phase estimation to a 
discrete-time quantum walk derived from one can simulate H for time t in a number of walk 
steps that grows only linearly with t [7]. More precisely, we have 

Theorem 2. For any Hermitian matrix H , there is a discrete-time quantum walk on the graph of 
nonzero entries of H such that e~^^^ can he simulated with error at most 5 using 0{\\ahs{Ht)\\/ ^) 
steps of the walk, where ahs{H) is the matrix with entries abs{H)jk = \Hjk\. 

Of course, to apply this result, we must implement the discrete-time quantum walk derived 
from H. This can be done efficiently for various concisely specified non-sparse Hamiltonians [7]. 
Note that the same theorem holds with ||abs(-ff)|| replaced by ||i^||i (a matrix norm defined in 
Section 2); this quantity is generally larger than ||abs(//)||, but the resulting walk may be easier to 
implement. 

Notice that the overhead of this simulation is proportional not to the spectral norm \\H\\, but 
to a measure of the size of H that can be much larger when some entries of H are negative (or 
more generally, complex). This naturally raises the question of whether an improved simulation 
is possible. In the present article, we examine this possibility. Unfortunately, our main result is 
negative: there is no general Hamiltonian simulation algorithm that uses only poly(||-fft||, log A^) 
steps (Theorem 4). 

The remainder of this article is organized as follows. In Section 2, we introduce various matrix 
norms that arise when quantifying the complexity of Hamiltonian simulation and relate them to 
one another. We then move on to lower bounds for non-sparse Hamiltonians in Section 3, where we 
describe how the no-fast-forwarding theorem can be modified to give a lower bound that depends 
on the spectral norm rather than various smaller measures of the size of a Hamiltonian. Then, in 
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Section 4, we present the main result, an example of a family of Hamiltonians with ||abs(i7) || S> \\H\\ 
that cannot be simulated in time poly (||-fft||, log N). We then turn to upper bounds in Section 5, and 
investigate how certain structured Hamiltonians can be simulated in time 0(||i/t||) — in particular, 
we give a positive result on the simulation of Hamiltonians whose graphs have small arboricity.^ 
Finally, we conclude in Section 6 with a discussion of open problems. 



2 Measures of simulation complexity 

Upper and lower bounds on the complexity of simulating a Hamiltonian H depend on some measure 
of the size of H. Since e~'^* depends only on the product Ht, the complexity of simulating H 
for time t is some function of Ht. For example, the no-fast-forwarding theorem clearly cannot be 
circumvented by simply multiplying H hy a constant. Similarly, simulation results such as those for 
sparse Hamiltonians, using operations, and Theorem 2, using 0(||abs(//t)||) operations, 

depend on various measures of the size of Ht. 

In this section, we take a step back and consider properties of various measures of the size of 
H that may play a role in the complexity of simulating it. Let v{Ht) be a function that measures 
the complexity of simulating H for time t. We can infer various properties of u{-) as follows. Since 
it is trivial to simulate the identity operation, z/(0) = 0. On the other hand, if 7^ 0, then it 
requires some work to simulate, so ^{H) > 0. It is also plausible to suppose that v{tH) = \t\i'{H). 
We clearly have v{Ht) < \t\v{H) for i S Z, since Ht can be simulated using t exact simulations of 
H. On the other hand, the no-fast-forwarding theorem suggests that this is the best possible way 
to simulate Ht in general. Finally, since the Lie product formula can be used to simulate H + K 
using simulations of H and -fC, we expect that v{H + K) ^ v{H) + ^{K) (up to the fact that a 
bounded-error simulation requires a slightly superlinear number of operations). 

These properties are reminiscent of the axioms for matrix norms, suggesting that it may be 
reasonable to quantify the complexity of simulating H in terms of some matrix norm i'{H). Indeed, 
results on the simulation of sparse Hamiltonians are typically stated in terms of the spectral norm 
and Theorem 2 also involves matrix norms. We now introduce various matrix norms relevant 
to Hamiltonian simulation. 

Definition 1 (Spectral norm). The spectral norm of a matrix H is defined as 

\\Hv\\ 

\\H\\ := max— n — n~ = niax (2) 

v^o \\v\\ \\v\\=l 



where \\v\\ is the standard Euclidean vector norm defined as ||f || := l^iP- 

The spectral norm, also known as the operator norm or induced Euclidean norm, is equal to the 
largest singular value of the matrix. For Hermitian matrices it is also equal to the magnitude of the 
largest eigenvalue. This norm arises in the complexity of sparse Hamiltonian simulation algorithms, 
and in Theorem 2 as the spectral norm of abs(i/), the matrix with entries abs[H)jk = \Hjk\. 

Definition 2 (Induced 1-norm). The induced 1-norm of a matrix H is defined as 

lli^lli := max — -iii = max^^ Iff,-,!, (3) 

" v^o \\v\\i j ^' ^ ' 

i 

where ||u||i is the vector 1-norm defined as ||f ||i := \ vi\. 



graph is said to have arboricity k if its adjacency matrix can be written as the sum the adjacency matrices of 
k forests, but not k — 1 forests. 
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The induced 1-norm is equal to the maximum absolute column sum of the matrix. As mentioned 
in Section 1, Theorem 2 holds with ||abs(i/)|| replaced by This does not, however, lead to a 

superior simulation method since ||-ff||i > ||abs(-ff)||, as shown in Lemma 1 below. 

Definition 3 (Maximum column norm). The maximum column norm of a matrix H is defined as 



mcn(i7) := max 



max — — - — 

k; 1 



max \ \He 



J II' 



(4) 



where ej is the j^^ column of the identity matrix. 

The maximum column norm is the maximum Euclidean norm of the columns of H. This 
norm appears in the complexity of an algorithm for simulating Hamiltonians whose graphs are 
trees [7, Theorem 4] and in the related Proposition 2 in Section 5. 



Definition 4 (Max norm). The max norm of a matrix H is defined as 

inax(H) := max\Hij\. 



(5) 



The max norm is just the largest entry of H in absolute value. It is a matrix norm, and is 
typically much smaller than the other norms mentioned. 

The following lemma relates the various norms introduced above. 



Lemma 1. For any Hermitian matrix H G C^^^ , we have the following inequalities: 

max(if) < mcn{H) < \\H\\ < ||abs(if)|| < ||if||i < ViV mcn(if) < iVmax(i7). 
Furthermore, each of these inequalities is the best possible. 



(6) 



Proof. The first inequality follows from the fact that the maximum element in any column cannot 
be greater than the Euclidean norm of that column. We have 



max 



(H) = max ^max l-ffyl^ < max l-^^ijP = mcn(if). 



(7) 



The next inequality follows from the observation that mcn(H) is defined by a maximum over 
the standard basis vectors ej, whereas \\H\\ is defined by a maximum over all vectors with norm 1, 



which contains the set of all ej . Thus 



mcn(H) = max\\HeA\ < max \\Hv\\ = \\H\\. 
Using the triangle inequality with \\H\\ = iiiax||^||=i(^,- | ^ • HijVj\'^)^ , we get 



(8) 



\H\\ < max f V y^ \H, 



Now by maximizing over all v with ||z;|| = 1 instead of only those with vj > 0, we get 



(9) 



max y ahs{H)ijVj j < max ^ ahs{H) 



\\ahs{H)\\. (10) 
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The last inequality is actually an equality due to the Perron-Frobenius theorem. 

Since abs(iJ) is a symmetric matrix, there is an eigenvector z with eigenvalue equal in magni- 
tude to ||abs(iJ)||. Clearly this eigenvector satisfies ||abs(ff)2;||i = ||abs(i?)||||z||i. Using this and 
maximizing over all nonzero vectors, we have 

||abs(i/)|| = ""^y]""""^ = < max^M^ = ||abs(/7)|K. (11) 

ll-^lli ll-^lli IpIIi 

The inequality now follows from the fact that ||-ff||i = ||abs(-ff) ||i, since 

||abs(i7)||i = max > | abs(i?)ij| = max > = HffHi. (12) 



For the next inequality, we use the fact that ||f||i < V-ZVUwH for all vectors v. This can be 
proved using the Cauchy-Schwarz inequality, u)| < ||ii||||7;||, by taking Ui = Vil\vi\. Let jmax 
be the index j that maximizes l-ffijl- Thus ||-ff||i = l-f^iimaxl = ll-^^imaxlli- Using these two 
inequalities, it follows that 

||if||i = ll/^Cj^.^Jli < \/iV||i/ej„_|| < ViVmax||i/ej|| = ^mcn{H). (13) 

j 

The last inequality is proved using the fact that for any j, Hij < maxj Hij; thus 

mcn(H) = max [S^ l-f^ijP ^ max /N max | Hij \ ^ = \/iV max \ Hij\ = y/N max(if). (14) 

j \l ^ j V i ij 

For each of these inequalities, there is a matrix that achieves equality. The first four inequalities 
are saturated when H is the identity matrix since the relevant norms are all equal to 1. The last 
two inequalities are satisfied with equality when H is the all-ones matrix (i.e., for all i,j, Hij = 1), 
since then = N, mcn{H) = y/N, and max(i^) = 1. □ 

Since Theorem 2 involves ||abs(-ff) ||, we would like to relate ||abs(i/)|| and ||i^||. Lemma 1 
gives ||abs(i/)|| < -v/iV||-ff||, which is also the best possible inequality between the two norms. For 
example, when is a power of 2, the matrix H = R^^°s^ achieves equality, where R ■= ( } _\ ) /\/2 
is the Hadamard matrix. It has \\H\\ = 1, but ||abs(-?/)|| = y/N. This shows that Theorem 2 
might not be as powerful as we would like, since for some Hamiltonians, the simulation method of 
Theorem 2 may be infeasible even when \\H\\ is small. 

Although the above inequalities cannot be tightened in general, there can of course be stronger 
relationships among the various norms for special classes of Hamiltonians. For example, if H is 
sparse, observe that the norms mentioned above can differ at most by a factor of poly (log A^). 
Specifically, if H is fc-sparse (i.e., it has at most k nonzero entries per row), then 

max{H) < mcn{H) < \\H\\ < ||abs(i?)|| < \\H\\i < Vkmcn{H) < kmax{H). (15) 

The first four inequalities are from Lemma 1. The inequality \\H\\i < Vkmcii{H) follows from 
(13) using the fact that < when v has at most k non-zero entries (this can be proved 

using the Cauchy-Schwarz inequality as before). The last inequality follows from (14) and the 
inequality \Hij\'^ < A;maxj \ Hij\'^, which holds when H is A;-sparse. Thus the choice of norm 
for sparse matrices is quite flexible, since all the above-mentioned norms are equivalent up to 
polynomial factors. 

In Section 5, we discuss some more examples in which Lemma 1 can be strengthened, with 
emphasis on the implications for simulations. 
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3 A no— fast-forwarding theorem for dense Hamiltonians 



The no-fast-forwarding theorem (Theorem 1 above) estabhshes a lower bound for the simulation of 
sparse Hamiltonians. Although we stated the theorem with \\H\\ = 1, any of the norms in Lemma 1 
could have been used, since the Hamiltonian used in the proof of the no-fast-forwarding theorem 
is 2-sparse, and by (15) the norms differ at most by a factor of 2. In particular, the theorem could 
be restated with max(if) < 1 or ||i^||i < 2. 

Since the choice of norm is unclear, it is conceivable that there are Hamiltonian simulation 
algorithms that run in time 0{max{Ht)) or 0{mcn(Ht)). To distinguish between the norms, we 
require a dense Hamiltonian. The aim of this section is use the proof techniques of Theorem 1 to 
establish a similar theorem for dense Hamiltonians. In particular, we show that there does not exist 
an algorithm for simulating dense Hamiltonians in time 0(max(i?t)) or 0{mcn{Ht)). However, 
this does not appear to rule out poly(||iJi||) simulations, which we rule out in the next section. 
Although Theorem 4 in the next section is stronger than Theorem 3 below, we briefly present 
this straightforward generalization of the no-fast-forwarding theorem to show the extent of that 
approach as applied to the non-sparse case. 

As in Theorem 1, we consider a black-box formulation of the problem of simulating dense 
Hamiltonians. There is a black box that can be queried with a row index j, which outputs the 
entire j^^ row. Since the Hamiltonian is dense, this output can be exponentially large. This is not a 
problem, however, since our goal is to find a lower bound on query complexity, not time complexity. 
Even though each query takes exponential space, it counts as only one query. The black box used 
here is more powerful than the one in Theorem 1, so the lower bound proved below also carries 
over to the black box used in Theorem 1. 

In terms of this black-box model, we have the following: 

Theorem 3. For any positive integer N , there exists a non-sparse Hamiltonian H such that sim- 
ulating the evolution of H for time t = ttN/2 within precision 1/4 requires at least N/A queries to 
H. This Hamiltonian has \\H\\ = 1, mcn{H) = 0(l/\/]V), and m.ax{H) = @{1/N). 

Proof. The main idea, as in the proof of Theorem 1 [4], is to construct a Hamiltonian whose 
simulation for time t = TrN/2 determines the parity of bits. Since we know that computing the 
parity of bits requires at least A^/2 queries [3, 11], this Hamiltonian cannot be simulated with 
o{N) queries. Moreover, we want this Hamiltonian to be non-sparse. 

We start with a simple Hamiltonian Hi whose graph is just a line with A^-|- 1 vertices. Consider 
the Hamiltonian acting on vectors \i) with i G {0, . . . , A^}. The nonzero matrix entries of Hi are 
{i\Hi\i + l) = {i + 1 \Hi\i) = .y{N-i){i + l)/N for i G {0, 1, . . . , A^ - 1}. This Hamihonian has 
\\Hi\\ = 1, and simulating Hi for t = TrN/2 starting with the state |0) gives the state |A^) (i.e., 

e-iHit^O) = \N)). 

Now, as in Ref. [4], consider a Hamiltonian H2 generated from an A^-bit string SqSi . . . S^-i. 
H acts on vertices \i,j), with i G {0, . . . , A^} and j G {0, 1}. The nonzero matrix entries of this 
Hamiltonian are 

{i,j \H2\ i + 1, j e Si) = {i + l,j e Si \H2\ i,j) = y'iN -i)ii + l)/N (16) 

for all i and j. By construction, |0,0) is connected to either |i,0) or \i, 1) for any i; it is connected 
to \i,j) if and only if j = Sq^bSiQ. . .©5i_i. Thus |0,0) is connected to either |A^,0) or |A^, 1), and 
determining which is the case determines the parity of S. The graph of this Hamiltonian consists 
of two disjoint lines, one of which contains |0, 0) and either | A^, 0) or | A^, 1) depending on the parity 
of S. Just as for Hi, starting with the state |0,0) and simulating H2 for time t = itN/2 will give 
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either |A^, 0) or |A^, 1), which determines the parity of S. Note that since H2 is a permutation of 
Hi® Hi, \\H2\\ = \\Hi\\ = 1. 

Finally, we construct the dense Hamiltonian H that has the properties stated in the theorem. 
As before, H is generated from an A^-bit string SqSi . . . Sn-i- H acts on vertices \i,j,k), with 
i G {0, . . . , A^}, j £ {0, 1}, and k G {0, A^ — 1}. The nonzero entries of H are given by 

{i,j,k\H\i + l,j®Si,k') = {i + l,j®Si,k'\H\i,j,k) = y/(N-i){i + l)/N^ (17) 

for all i, j, k, and k' . The graph of H is similar to that of H2, except that for each vertex in H2, 
there are now N copies of it in H. This Hamiltonian is dense because it has Q{N'^) vertices and 
each vertex is connected to all A^ copies of its neighboring vertices, which gives at least A^ nonzero 
entries in each row. 

Now we simulate the Hamiltonian starting from the uniform superposition over the copies of 
the |0, 0) state, i.e., from the state |0, 0, /c). The subspace span{^^ |i, j. A;)} of uniform 

superpositions over the third register is an invariant subspace of this Hamiltonian. Since the initial 
state lies in this subspace, the quantum walk remains in this subspace. In other words, the quantum 
walk on this dense graph starting from the chosen state reduces to the quantum walk on H2 starting 
from the |0, 0) state. 

Now, just as before, the parity of S can be determined by simulating H for time t = tiN/2. 
This gives the lower bound of N/2 queries. 

To calculate the norms of this Hamiltonian, we observe that H = H2 ® J/N, where J is the 
all-ones matrix of size N x N. This gives ||-fr|| = \\H2\\ ■ || J||/A^ = 1. Direct computation shows 
that max{H) = e{l/N) and mcn{H) = G(l/ViV). □ 

This theorem rules out algorithms that make only 0{max(Ht)) or 0{mcn{Ht)) queries, since 
for this Hamiltonian with t = 7tN/2 we have max{Ht) = 0(1) and mcn(i/t) = @(^/N), both of 
which are disallowed by the lower bound of Q{N). However, this does not distinguish between \\H\\ 
and ||abs(-ff)|| (since ahs{H) = H), and in fact ||-ff||i ~ 1 as well. In the next section we construct 
examples with ||absi?|| S> \\H\\ in order to show that a general simulation using 0(||fft||) steps (or 
even poly(||ift||,log A^) steps) is not possible. 

4 A stronger limitation for dense Hamiltonians 

As discussed in Section 1, there are dense Hamiltonian simulation algorithms that use 0(||abs(i?t)||) 
or 0(||i7t||i) steps of a discrete-time quantum walk. However, in light of the no-fast-forwarding 
theorem for dense Hamiltonians, it might be reasonable to hope that dense Hamiltonians can be 
simulated in 0(||//i||) steps, or at least in poly(||-fft||,log A^) steps. Indeed, if such simulations ex- 
isted they could be applied to give new quantum algorithms for various problems [7] . Unfortunately, 
in this section, we show that such simulations are not possible in general. 

The currently known dense Hamiltonian simulation algorithms rely on certain properties of 
the Hamiltonian that we call its structural properties. By this we mean the location of nonzero 
entries in H, which correspond to the location of edges in the graph of the Hamiltonian, and the 
magnitudes of the edge weights. (The remaining information about the Hamiltonian is the phase 
of each matrix entry Hij.) 

Given the structural information, the currently known algorithms can simulate dense Hamilto- 
nians using 0(||abs(-fft) II) or 0(||-fft||i) calls to an oracle that gives the phase of the matrix entry 
at Hij. We call this the matrix entry phase oracle. This oracle provides the value of Hij/\Hij\ when 
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queried with the input (The oracle may return any complex number of unit modulus — say, 

1 — when Hij =0.) 

We show that given this matrix entry phase oracle and complete structural information, there 
exist some Hamiltonians that cannot be simulated with poly(||-frt||, log A^) queries (although they 
can be simulated with 0(||abs(-fft) ||) queries [7]). The following theorem is our main result. 

Theorem 4. No quantum algorithm can simulate a general Hamiltonian H G ([^NxN time t 
with poly(||i7t||, log A^) queries to a matrix entry phase oracle, even when given complete structural 
information about the Hamiltonian. 

Proof. The proof of the theorem is divided into two parts. First we show that there exists a set of 
Hamiltonians of size N x N that is hard to simulate on average for a particular time t. Specifically, 
we show that simulating a Hamiltonian selected uniformly at random from this set for a chosen time 
has average-case query complexity r2( Y^V/logiV). Then we show that a Hamiltonian simulation 
algorithm that makes poly(||fft||, log A^) queries would violate this lower bound. The lemmas used 
in the proof are proved in the appendix. 

To show the lower bound, we need a black-box problem with an Q{y/N/\ogN) average-case 
lower bound, and a set of Hamiltonians whose simulation would solve this problem. We consider 
the problem of distinguishing strings s E { — 1, +1}^^ that have sum —B or +B, given a black box 
for the entries of the string. When queried with an index i G {1, 2, . . . , M}, the black box returns 
the value of Sj G {— 1,+1}, where s = siS2 . . . sm- The following lemma characterizes the query 
complexity of this problem. 

Lemma 2. Suppose we are given black-box access to a string s G { — where s is chosen 
uniformly at random from the set of strings with '^^Si £ { — B,+B}. Then determining Sj has 
average-case quantum query complexity Q{M/B). 

Thus, determining whether the sum is —\/M logAf or +v'MlogM, with the promise that one 
of these is the case, requires M / log M) quantum queries on average. For each string s, we 
construct a Hamiltonian Hs whose simulation for a particular time allows us to distinguish the two 
possible cases assuming s satisfies the promise. 

Let Hg be a symmetric circulant matrix of size N x N, where N = 2M + 1 is odd. (A circulant 
matrix is a matrix in which each row is rotated one element to the right relative to the preceding 
row.) In general, a circulant matrix is completely specified by its first row. However, since Hg is 
a symmetric circulant matrix, it is completely specified by the first M + 1 entries of the first row. 
Let the first entry of the first row be 0, and the next M entries of the first row be si,S2, ■ ■ ■ , sm- 
In other words, the first M + 1 entries of the first row of Hg are followed by the string s. Then 
the remaining entries of the first row are sm, sm-i, ■ ■ ■ ,si. 

Given a black box for the entries of s, we can easily construct a black box for the entries of Hg. 
Indeed, one query to H can be simulated with at most one query to the string s. Sometimes no 
query to s is needed, since the diagonal entries of Hg are always 0. 

Since Hg is a circulant matrix, it is diagonalized by the discrete Fourier transform. Its eigen- 
values Ao, Ai, . . . , Aat-i are 




and in particular 




M 



(18) 



Thus the time evolution of Hg can be used to learn whether Sj is — \/ AI log M or -\-^yMlogM. 
Since Aq = 2 sj , the two cases can be distinguished by determining the sign of Aq . Note that we 
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know the eigenvector corresponding to Aq: it is the first column of the discrete Fourier transform 
matrix, i.e., the uniform superposition over all computational basis states. 

Consider the eigenvalues and eigenvectors of the unitary matrix e~^^'^ that corresponds to evolv- 
ing H for time r = 7r/4 V^MlogM. The eigenvectors of this matrix are the same as those of H, and 
each eigenvalue of H corresponds to the eigenvalue exp (— iA^r) of e^'^"^. Thus the uniform su- 
perposition is an eigenvector of e"'^"^ with eigenvalue exp(— iAor) = exp (— ivr Si/2\/M log M) . 
Since Si/y/M log M is either ±1, the two possible eigenvalues are ±i. Because the eigenvector is 
known, the two possibilities can be easily distinguished by phase estimation on the unitary e~^^'^. 
Since the problem of distinguishing these two cases has an ^{\/ M/ log M) average-case lower bound 
by Lemma 2, we get an average-case lower bound for simulating such Hamiltonians 

for time r = n / 4:^/ M log M . 

Now we want to show that a poly(||i/i||, log A^) Hamiltonian simulation algorithm violates this 
lower bound. To do this, we need to know the typical behavior of ||-ffs|| when s satisfies the promise. 
Let S := { — and let V be the subset of strings in S that satisfy the promise. As a first 
step, let us see the behavior of for all strings s € 5, not just those that satisfy the promise. 

Lemma 3. Let Hg G M^^^ he a symmetric circulant matrix of size N = 2M + 1 with the first 
M + 1 entries of the first row given by followed by a string s £ S. If s is chosen uniformly at 
random from S, denoted s Gr <S, then for any d > 0, 

In fact, even stronger results of this kind are known [15,20], but the above bound is easy to 
prove and sufficient for our purposes. Using Lemma 3, we wish to bound the spectral norm of Hs 
when s Gil V. We can do so by first calculating the probability that a randomly selected string 
satisfies the promise. 

Lemma 4. If s Gr S, then the probability that s satisfies the promise is 

Pr (s G = e(l/M). (20) 

s6r5 

Using Lemmas 3 and 4, we can upper bound the probability that is large when s is chosen 
uniformly at random from V. If X is the event that \\IIs\\ > 4:d\/ M log M and Y is the event that 
s £ V, then Pr(X) is given by Lemma 3 and Pt{Y) is given by Lemma 4. In these terms, we can 
compute an upper bound for Pr(X|y) as follows: 

Pr^ > 4dyMbiM) = FriX\Y) = ^^^^ < ^ = 0{M^~^''). (21) 

To achieve a contradiction, assume that for some constant c > 0, there exists a Hamiltonian 
simulation algorithm using O {{\\Ht\\ logM)'^) queries to simulate H for time t. By (21), we know 
that Hg almost always has spectral norm smaller than 4d\/ M log M when s Gr V. Since < 
= 2M, we can compute the average-case query complexity of the claimed algorithm as 

follows: 

E {\\Ht\\'')< Pr (\\Hs\\ < M^/M log m) O ( [M^jM log M , ^ J" ^ ^ log M 

sGbV «6rP V ^ V V 4^/M log AI 

+ Fr^ {\m > My^m^) O (((2M)^^^^^ ^oguj) (22) 
< OiidlogMY) + O (^M'/^-^^'+\logMy/^^ , (23) 
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and by choosing 2(P > c/2 + 2, we have 

E {\\Ht\n = {{log MY). (24) 

Thus the average-case query complexity of the claimed algorithm is 0((logM)'^), which violates 
the lower bound of n{^/MJlogM). □ 

The proof technique above can be extended to rule out algorithms with query complexity sub- 
exponential in (||i?f||,log A^) as well, by changing the promised set (i.e., the value of B used in 
Lemma 2) and choosing a larger value of d in Lemma 3. Exponential functions of (||fft||, log A^) 
cannot be ruled out, of course, since any Hamiltonian can be simulated by making 0{N^) queries, 
which is exponential in logA^. On the other hand, if we insist that the query complexity of an 
algorithm depends only on \\Ht\\ (and not log A^), then the proof above can be modified to rule out 
algorithms whose time complexity is an arbitrary function of For example, there exists no 

Hamiltonian simulation algorithm that makes exp(exp(||fft||)) queries. 

Finally, we emphasize that even though the above proof involves average-case complexity and 
distributions over inputs, Theorem 4 is a statement about the worst-case complexity of simulating 
Hamiltonians. 



5 Simulation complexity for structured Hamiltonians 

As the previous section shows, we cannot hope for general Hamiltonian simulation algorithms that 
scale polynomially in the spectral norm of the Hamiltonian. Although we do know algorithms that 
scale like 0(||abs(i?)t||), Lemma 1 tells us that ||abs(-ff)|| could be exponentially larger than \\H\\. 
However, we can achieve better scaling for special classes of Hamiltonians. For example, we saw in 
Section 2 that much stronger bounds hold for sparse Hamiltonians. 

We can also improve the inequalities of Lemma 1 for certain classes of non-sparse Hamiltonians. 
For example, consider the class of Hamiltonians whose graphs are trees (where the graph of a 
matrix refers to the graph of its nonzero entries). Such Hamiltonians can be efficiently simulated 
even when they are not sparse: in this case, Theorem 2 gives a simulation using 0(||//i||) steps of 
a discrete-time quantum walk, because when the graph of -fT is a tree, ||abs(-fr)|| = \\H\\. 

Proposition 1. // the graph of a Hermitian matrix H is a tree, then there exists a unitary matrix 
U such that UHU'^ = abs{H). In particular, ||abs(-ff)|| = \\H\\. 

Proof. The matrix U is diagonal. To define Ua, we arbitrarily fix some vertex as the root and con- 
sider the unique path from the root to vertex i. Let the path contain the vertices iQ,ii, . . . , ip-i,ip, i, 
where io is the root and ip is the parent of i. For each nonzero entry of H, define aij := Hij/\Hij\. 
Then let Ua := 1 if i is the root and Uu := aigi^^ai^i^ ■ ■ ■ ai^_^ij,ai^^i otherwise. 

Since U is diagonal, {UHU'^)ij = UuHijUjj. If i and j are not adjacent in the tree, then 
{UHU^)ij = Hij = as required. Otherwise, suppose without loss of generality that j is the parent 
of i. Then UuUj^ = |aion P|ani2 1^ • • • = Hji/\Hij\, so {UHU^)ij = \Hij\ as claimed. □ 

Thus Theorem 2 gives a simulation using 0(||//t||) steps of a discrete-time quantum walk. 
However, there is another simulation method for such Hamiltonians that uses only mcn(i^i)^'^°(^) 
steps [7, Theorem 4]. It seems from Lemma 1 that the former method might be inferior, but due 
to Proposition 2 below it is, in fact, superior to the mcn{Ht)^~^°^^^ simulation (except with respect 
to error scaling), since \\Ht\\ < 2mcn{Ht) when the graph of the Hamiltonian H is a tree. 
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If H can be expressed as the sum of a small number of Hamiltonians, each of whose graph is 
a forest, then H can be efficiently simulated when \\H\\ is small. Recall that a graph is said to 
have arboricity k if its adjacency matrix can be written as the sum of the adjacency matrices of k 
forests, but not k — 1 forests. 

Proposition 2. // the graph of a Hamiltonian H has arboricity k, then \\ahs{H)\\ < 2k mcn{H) . 
Moreover, ||abs(ii')|| < 2k\\H\\ and \\H\\ < 2kincn{H). 

Proof. We begin by considering the case of a star graph. A star graph is a tree on n vertices with 
one vertex having degree n — 1 and the others having degree 1 (i.e., the complete bipartite graph 
Ki^n-i)- We show that if 5 is a Hamiltonian whose graph is a star, 

mcn(5) = ||5|| = ||abs(5)||. (25) 

By permuting the vertices, the first vertex can be chosen to be the one with maximum degree. 
Now the first column of the matrix S completely determines the Hamiltonian. Let the first column 
be w. The matrix S has first column w and first row . It is easy to see that mcn(5) = \\w\\. S 
has exactly two nonzero eigenvalues, ib||7i;||, corresponding to the eigenvectors ||w||ei zizw, where ei 
is the first column of the identity matrix. Since \\S\\ is the maximum eigenvalue, \\S\\ = mcn(S'). 

Moreover, since abs(S') is a Hamiltonian whose graph is a star, we have ||abs(5)|| = mcn(abs(5)). 
For any matrix H, mcn(abs(i?)) = mcn(H), since the norms of the columns depend only on the 
magnitude of each entry. This proves the desired result, mcn(S') = \\S\\ = ||abs(S')||. These results 
also hold for a disjoint union of star graphs (a forest of stars), since the above norms all have the 
property that i^{Ai © ... © A„) = max(i/(y4i), . . . , i'{An)). 

To show the result for graphs of arboricity k, we begin by showing how a rooted tree can be 
decomposed into the sum of two forests of stars. The first forest contains all the edges in which the 
parent vertex is at a even distance from the root. The second forest contains the rest of the edges. 
This decomposes a rooted tree into two forests of stars, and similarly decomposes a forest into two 
forests of stars. Since the Hamiltonian has arboricity k, it can be decomposed into k forests, which 
can be decomposed into 2k forests of stars. 

Thus H = Yl'i=o where each of the Si is a Hermitian matrix whose graph is a forest of stars. 
Moreover, the matrices Si have no overlapping edges, i.e., if {Si)ij / for some I, then {Si)ij = 
for all other I. Therefore, for all i,j,l, Hij > {Si)ij, which implies mcn(H) > mcn(S'/) for all /. This 
gives 

mcn(F) > ^ E ™('^') = ^ E ll-^HI- (26) 
I I 

Using the triangle inequality, we find 

||abs(i/)||<5^||abs(5z)||=5^||5z||. (27) 
I I 

Combining (26) and (27) gives the main result. Using Lemma 1 and the main result gives 
||abs(F)|| < 2k\\H\\ and \\H\\ < 2kmcn{H). □ 



6 Open questions 

Although we have ruled out the possibility of a generic Hamiltonian simulation algorithm using only 
poly(||fft||, log N) operations, we can nevertheless hope that some nontrivial classes of Hamiltonians 
can be simulated in poly(||-fft||,log A^) steps even though \\H\\ <C ||abs(if)||. 



11 



One approach is to consider changing the basis in which the Hamiltonian is simulated. Clearly, if 
unitary transformations U and can be performed efficiently, then H can be simulated efficiently 
if and only if UHU^ can. There must exist bases in which UHU^ is sparse (such as the basis in which 
it is diagonal), which may lead to efficient simulations of H. Some trivial classes of Hamiltonians 
can be simulated in this way, such as Hamiltonians that are tensor products of small factors. For 
example, the Hamiltonian i?®", where R ■= ({ Ji) /V^ is the Hadamard matrix, has H-R*®"!! = 1 
and Hi?*®"!!! = ||abs(i?®"')|| = 2"/^, yet the evolution according to i?®" is easy to simulate. A 
similar simulation for a case where the Hamiltonian is not a tensor product was used in Ref . [9] . 

An alternative method is to investigate ways of decomposing a Hamiltonian as a sum of Hamil- 
tonians that can be efficiently simulated. For example, we can simulate Hamiltonians whose graphs 
have polynomial arboricity by decomposing them into stars (although we saw in Proposition 2 
that such Hamiltonians can already by simulated efficiently by the method of Ref. [7] since they 
satisfy ||abs(-ff)|| = poly(||i/||)). More generally, other graph decompositions could give rise to new 
efficient simulations. 

Another interesting problem is to find classes of Hamiltonians that can be simulated in sublinear 
time. These correspond to quantum systems whose time evolution can be fast-forwarded. Some 
Hamiltonians may even be simulated in constant time, if e~^^'^ = I after some constant time r (for 
example, the case i?®" mentioned above has r = 27r). 

Acknowledgments 

We thank Aram Harrow for suggesting the use of Hoeffding's inequality to simply the proof of 
Lemma 3. This work was supported by MITACS, NSERC, QuantumWorks, and the US ARO/DTO. 

Appendix: Proofs of lemmas 

In this appendix, we prove Lemmas 2, 3, and 4. 

Lemma 2. Suppose we are given black-box access to a string s G { — where s is chosen 
uniformly at random from the set of strings with Sj G { — B,+B}. Then determining ^^Si has 
average-case quantum query complexity Q{M/B). 

Proof. We show the lower bound by first showing the same lower bound for the worst-case problem 
using the quantum adversary method [2] and then reducing the worst-case problem to the average- 
case problem. 

For the worst-case lower bound, we use the notation of Theorem 2 of Ref. [2]. We require two 
sets of inputs X and Y that have different outputs. Let X be the set of all strings for which 

Sj = —B, and Y be the set for which Sj = +B. We define a relation between the sets as 
follows. Let an element x £ X he related to an element of y S y if and only if y can be reached 
from X by changing exactly B/2 —Is to -|-ls in the string x. Note that a string in X has exactly 
(M/2 -\- B/2) -Is and (M/2 - B/2) +ls. 

Using these sets X and Y, and the relation defined above, it is easy to see that 




and 




(28) 



so 




(29) 
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The adversary method now provides a lower bound of 0, (^y^ mm' / 11'^ = Q.{M/B) for the worst-case 
query complexity of this problem. 

The worst-case query complexity can now be reduced to the average-case query complexity 
under the uniform distribution over all input strings satisfying the promise. To do this, we first 
apply a uniformly random permutation to the input string, and then with probability ^ multiply 
all the entries by —1 (and leave them unchanged with probability ^). The resulting distribution is 
now uniform over all input strings satisfying the promise. If the string is not multiplied by —1, then 
the output of the permuted string is the same as the input string. If all the entries are multiplied 
by —1, then the output of the modified string is the opposite of that of the original input. 

The lower bound is tight due to a matching upper bound provided by the algorithm for approx- 
imate quantum counting [5]. To distinguish the two types of inputs, we can approximately count 
the number of +ls to accuracy e = B/2M, which requires 0{M/B) queries. □ 

Lemma 3. Let Hg G R^^^ be a symmetric circulant matrix of size N = 2M + 1 with the first 
M + 1 entries of the first row given by followed by a string s € S. If s is chosen uniformly at 
random from S, denoted s Er S, then for any d > 0, 

pr^ {m\ > id,/m^) < i±^. (30) 

Proof. The eigenvalues of Hg are = 2 J2jLi '^j cos where rG{0,l,...,A^ — 1}. We wish to 
bound the probability that Xr is large, so as to bound the probability of ||-ffs|| = max^ \ Xr\ being 
large. This is achieved by applying Hoeffding's inequality [16, Theorem 2]. 

Theorem 5 (Hoeffding's inequality). If Xi,X2, ■ ■ ■ ,Xm are independent and aj < Xj < bj for all 

1 ^ J ^ M, then for any t > 0, we have 

( -2MH^ \ 
Pr (X - E (X) > Mt) < exp (31) 

where X = YljLi ■ 

If we take Xj = 2s j cos then X = Xj- = 2 X^jli Sj cos each Xj is between —2 and +2, 
and E {X) = ^jL-^ E (Xj) = 0. By choosing t = Ady/\og M/M, we get 

^ f. nnr. — V7\ f -2 16d'^ log M/M)\ 1 

Pr [Xr > 4dx/MlogM) < exp ^^m^^^ J = J^' ^^2) 

Since a similar inequality holds when Xj is replaced by —Xj, we get 



Pr (|A,| > M^MlogMj < (33) 
Finally, since \\IIs\\ = max^ |Ar|, a union bound gives 

Pr [\\Hs\\ > Ady^M log m) < (34) 

which implies the desired result. □ 
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Lemma 4. If s S, then the probability that s satisfies the promise is 

Pr (s G = e(l/M). (35) 

s£rS 

Proof. Of the 2^ strings of length M, those with sum —\JM log M or +-^Mlog M have either 
\{M + \JM log Af ) +ls or ^(M + -^M log M) —Is. Thus the total number of such strings is 

M \ 

M+^AI log A/ I • (36) 
2 / 

We can asymptotically approximate this expression using a well-known approximation for the 
binomial coefficients (see for example equations 4.5 and 4.10 of Ref. [19]), which states that 



^kj y/TTn/2 
provided \k — n/2\ = o{n'^/^). Applying this to (36), we get 



n\ 2"exp(-2(fc-n/2)Vn) 

r = (37) 



^(^ M+^T^ ) = e (2^exp(-logM/2)/^/M) = e(2^/M), 



(38) 



2 

which proves the claim. □ 
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